Processing received data

ABSTRACT

A method for controlling the processing of data in a data processor such that the data processor is connectable to a further device over a data link. The method comprising the steps of receiving data at an element of the data processor and if a set interval has elapsed following the receipt of the data, determining whether processing of the received data in accordance with a data transfer protocol has begun, and, if it has not, triggering such processing of the received data by a protocol processing element. The method then senses conditions pertaining to the data link and sets the interval in dependence on the sensed conditions.

1. PRIORITY CLAIM

This application is a continuation of and claims priority to U.S. patentapplication Ser. No. 13/633,830, filed Oct. 2, 2012, which is acontinuation of and claims priority to U.S. patent application Ser. No.12/215,437 filed Jun. 26, 2008 now issued as U.S. Pat. No. 8,286,193 onOct. 9, 2012, which is a continuation of and claims priority to PCTApplication No. PCT/GB2006/004946 filed on Dec. 28, 2006, which claimspriority to Great Britain Application No. 0526519.4 filed on Dec. 28,2005.

2. FIELD OF THE INVENTION

This invention relates to a method and apparatus for controlling theprocessing of data in a data processor.

3. BACKGROUND OF THE INVENTION

In data networks it is important to enable efficient and reliabletransfer of data between devices. Data can only reliably be transferredover a connection between two devices at the rate that a bottleneck inthe connection can deal with. For example, a switch in a TCP/IPconfigured connection may be able to pass data at a speed of 10 Mbpswhile other elements of the connection can pass data at, say, 100 Mbps.The lowest data rate determines the maximum overall rate for theconnection which, in this example, would be 10 Mbps. If data istransmitted between two devices at a higher speed, packets will bedropped and will subsequently need to be retransmitted. If more than onelink is combined over a connector such as a switch, then the bufferingcapacity of the connector needs to be taken into account in determiningmaximum rates for the links, otherwise data loss could occur at theconnector.

FIG. 1 shows the architecture of a typical networked computing unit 1.Block 6 indicates the hardware domain of the computing unit. In thehardware domain the unit includes a processor 2 which is connected to aprogram store 4 and a working memory 5. The program store stores programcode for execution by the processor 2 and could, for example, be a harddisc. The working memory could, for example, be a random access memory(RAM) chip. The processor is connected via a network interface card(NIC) 2 to a network 10. Although the NIC is conventionally termed acard, it need not be in the form of a card: it could for instance be inthe form of an integrated circuit or it could be incorporated into thehardware that embodies the processor 2. In the software domain thecomputing unit implements an operating system 8 which supports anapplication 9. The operating system and the application are implementedby the execution by the processor 3 of program code such as that storedin the program store 4.

When the computing unit receives data over the network, that data mayhave to be passed to the application. Conventionally the data does notpass directly to the application. One reason for this is that it may bedesired that the operating system polices interactions between theapplication and the hardware. As a result, the application may berequired to interface with the hardware via the operating system.Another reason is that data may arrive from the network at any time, butthe application cannot be assumed always to be receptive to data fromthe network. The application could, for example, be de-scheduled orcould be engaged on another task when the data arrives. It is thereforenecessary to provide an input mechanism (conventionally an input/output(I/O) mechanism) whereby the application can access received data.

FIG. 2 shows an architecture employing a standard kernel TCP transport(TCPk). The operation of this architecture is as follows.

On packet reception from the network, interface hardware 101 (e.g. aNIC) transfers data into a pre-allocated data buffer (a) and invokes aninterrupt handler in the operating system (OS) 100 by means of aninterrupt line (step i). The interrupt handler manages the hardwareinterface. For example, it can indicate available buffers for receivingdata by means of post( ) system calls, and it can pass the receivedpacket (for example an Ethernet packet) and identify protocolinformation. If a packet is identified as destined for a valid protocole.g. TCP/IP it is passed (not copied) to the appropriate receiveprotocol processing block (step ii).

TCP receive-side processing then takes place and the destination port isidentified from the packet. If the packet contains valid data for theport then the packet is engaged on the port's data queue (step iii) andthe port is marked as holding valid data. Marking could be performed bymeans of a scheduler in the OS 100, and it could involve awakening ablocked process such that the process will then respond to the presenceof the data.

In some circumstances the TCP receive processing may require otherpackets to be transmitted (step iv), for example where previouslytransmitted data needs to be retransmitted or where previously enqueueddata can now be transmitted, perhaps because the TCP transmit window(discussed below) has increased. In these cases packets are enqueuedwith the OS Network Driver Interface Specification (“NDIS”) driver 103for transmission.

In order for an application to retrieve data from a data buffer it mustinvoke the OS Application Program Interface (API) 104 (step v), forexample by means of a call such as recv( ), select( ) or poll( ). Thesecalls enable the application to check whether data for that applicationhas been received over the network. A recv( ) call initially causescopying of the data from the kernel buffer to the application's buffer.The copying enables the kernel of the OS to reuse the buffers which ithas allocated for storing network data, and which have specialattributes such as being DMA accessible. The copying can also mean thatthe application does not necessarily have to handle data in unitsprovided by the network, or that the application needs to know a priorithe final destination of the data, or that the application mustpre-allocate buffers which can then be used for data reception.

It should be noted that on the receive side there are at least twodistinct threads of control which interact asynchronously: the up-callfrom the interrupt and the system call from the application (describedin co-pending application WO2005/074611). Many operating systems willalso split the up-call to avoid executing too much code at interruptpriority, for example by means of “soft interrupt” or “deferredprocedure call” techniques.

The send process behaves similarly except that there is usually one pathof execution. The application calls the operating system API 104 (e.g.using a send( ) call) with data to be transmitted (step vi). This callcopies data into a kernel data buffer and invokes TCP send processing.Here protocol is applied and fully formed TCP/IP packets are enqueuedwith the interface driver 103 for transmission.

If successful, the system call returns with an indication of the datascheduled (by the hardware 101) for transmission. However there are anumber of circumstances where data does not become enqueued by thenetwork interface device. For example the transport protocol may queuepending acknowledgements from the device to which it is transmitting, orpending window updates (discussed below), and the device driver 103 mayqueue in software pending data transmission requests to the hardware101.

A third flow of control through the system is generated by actions whichmust be performed on the passing of time. One example is the triggeringof retransmission algorithms. Generally the operating system 100provides all OS modules with time and scheduling services (typicallydriven by interrupts triggered by the hardware clock 102), which enablethe TCP stack to implement timers on a per-connection basis. Such ahardware timer is generally required in a user-level architecture, sincethen data can be received at a NIC without any thread of an applicationbeing aware of that data. In addition to a hardware timer of this type,timers can be provided (typically in software) to ensure that protocolprocessing advances.

The setting of a software timer for ensuring the advance of protocolprocessing can impact on the efficiency of data transfer over thenetwork. The timer can for example be instructed by a transport protocollibrary of the application to start counting when a new packet isdelivered from the NIC to the transport protocol library. On expiry of atimeout, the timer causes an event to be delivered to an event queue inthe kernel, for example by issuing an event from the NIC 101, the eventidentifying an event queue in the OS. At the same time as the event isdelivered, an interrupt is scheduled to be delivered to the OS.According to the interrupt moderation rules in force, an interrupt israised and the OS will start to execute device driver code to processevents in the event queue. Thus, the software timer can be arranged totrigger protocol processing of data received at the data processor overthe network, or to trigger protocol processing of data for transmissionover the network. Such a timer preferably causes the kernel to beinvoked relatively soon (for example within 250 ms) after the receipt ofdata at the NIC.

FIG. 3a illustrates a conventional synchronous I/O mechanism. Anapplication 32 running on an OS 33 is supported by a socket 50 and atransport library 36. The transport library has a receive buffer 51allocated to it. The buffer could be an area of memory in the memory 5shown in FIG. 1. When data is received by the NIC 31 it writes that datato the buffer 51. When the application 32 wants to receive the data itissues a receive command (recv) to the transport library via the socket50. In response, the transport library transmits to the application amessage that includes the contents of the buffer. This involves copyingthe contents of the buffer into the message and storing the copiedcontents in a buffer 52 of the application. In response to obtainingthis data, the application may cause messages to be issued, such as anacknowledgement to the device which transmitted the data. A problem withthis I/O mechanism is that if the application fails to service thebuffer often enough then the buffer 51 can become full, as a consequenceof which no more data can be received.

FIG. 3b illustrates a conventional asynchronous I/O mechanism. Thismechanism avoids the overhead of copying the data by transferringownership of buffers between the transport library and the application.Before data is to be received, the application 32 has a set of buffers(B₁-B₃) allocated to it. It then passes ownership of those buffers tothe transport library 36 by transmitting to the transport library one ormore post( ) commands that specify those buffers. When data is receivedit is written into those buffers. When the application wants to accessthe data it takes ownership of one or more of the buffers back from thetransport library. This can be done using a gather( ) command thatspecifies the buffers whose ownership is to be taken back. Theapplication can then access those buffers directly to read the data. Aproblem with this I/O arrangement is that the amount of data that iscollected when the gather( ) command is executed could be very large, ifa large amount of buffer space has been allocated to the transportlibrary, and as a result the application may need considerable time toprocess that data.

Thus, with both of these mechanisms problems can arise if theapplication services the buffers at too fast or too slow a rate. If thebuffers are serviced too infrequently then they can become full (inwhich case the reception of data must be suspended) or the amount ofdata that is returned to the application when the buffers are servicedcould be very large. However, if the buffers are serviced too frequentlythen there will be excessive communication overheads between theapplication and the transport library as messages are sent between thetwo. One way of addressing these problems is to arrange for thetransport library to set a timer that, on reaching a timeout, triggersthe operating system to assist in processing any received data. This isparticularly useful in the case of a user-level network architecture,where the transport library is normally driven by synchronous I/O callsfrom the application. The timer could, for example, run on the NIC. Thismechanism can improve throughput but it has the disadvantage that itinvolves interrupts being set to activate the operating system toprocess the data. Processing interrupts involves overhead, and there mayalso only be a limited number of interrupts available in the system.

There is therefore a need for a mechanism which can increase theefficiency with which data can be protocol processed.

SUMMARY

According to the present invention there is provided a method forcontrolling the processing of data in a data processor, the dataprocessor being connectable to a further device over a data link, themethod comprising the steps of: receiving data at an element of the dataprocessor; if a set interval has elapsed following the receipt of thedata, determining whether processing of the received data in accordancewith a data transfer protocol has begun, and, if it has not, triggeringsuch processing of the received data by a protocol processing element;sensing conditions pertaining to the data link; and setting the intervalin dependence on the sensed conditions.

The data processor may be connectable to the data link by means of aninterface. A timer could reside on the interface, and the said intervalcould suitably be measured by the timer. The interface could suitably beimplemented in hardware.

The step of determining may comprise determining whether processing ofthe received data by code at user level has begun.

The said protocol processing element is preferably an operating system.

The received data could comprise data received at the data processorover the data link, optionally by means of an asynchronous transmission.

The received data could also comprise data to be transmitted over thedata link, optionally by means of an asynchronous transmission over thedata link.

The step of triggering processing may comprise issuing an interrupt,preferably to the operating system.

The said element could be a transport library associated with anapplication running on the data processor.

The method could further comprise the step of in response to receivingthe data, sending an instruction from the said element to the timer. Thestep of sending an instruction to the timer could comprise triggeringthe timer directly from the said element via a memory mapping onto thesaid interface.

The step of setting the interval may comprise reducing the interval ifthe sensed conditions are indicative of an increase in data rate overthe data link.

Buffer space could be allocated to the data link for storing datareceived at the data processor over the data link, and the protocolcould be a protocol that employs a receive window in accordance withwhich a transmitter of data according to the protocol will transmit nofurther traffic data once the amount of data defined by the receivewindow has been transmitted and is unacknowledged by the receiver, andthe step of setting the interval could comprise reducing the interval inresponse to sensing that the size of the buffer space allocated to thedata link is greater than the size of the receive window. The methodcould also comprise the step of varying the size of the buffer spaceallocated to the data link in response to a request from a consumer ofthe traffic data. The consumer could be an application running on thedata processor.

The step of sensing conditions could comprise sensing the presence in atransmit buffer of data to be transmitted over the data link.

The step of setting the interval could comprise reducing the interval inresponse to sensing in a transmit buffer data to be transmitted over thedata link.

The step of setting the interval could comprise reducing the interval inresponse to sensing that a congestion mode of the protocol is inoperation over the data link.

The protocol could suitably be TCP.

According to a second aspect of the present invention there is providedapparatus for controlling the processing of data in a data processor,the data processor being connectable to a further device over a datalink, the apparatus comprising: an element arranged to receive data; anda control entity arranged to, if a set interval has elapsed followingthe receipt of data, determine whether processing of the received datain accordance with a data transfer protocol has begun, and, if it hasnot, trigger such processing of the received data by a protocolprocessing element; wherein the control entity is further arranged tosense conditions pertaining to the data link and set the interval independ dependence on the sensed conditions.

According to a third aspect of the present invention there is provided acontrol entity for use with a data processor, the data processor beingconnectable to a further device over a data link, and comprising anelement arranged to receive data, the control entity being arranged to:if a set interval has elapsed following the receipt of data by the saidelement, determine whether processing of the received data in accordancewith a data transfer protocol has begun, and, if it has not, triggersuch processing of the received data by a protocol processing element;and sense conditions pertaining to the data link and set the interval independence on the sensed conditions.

The present invention will now be described by way of example withreference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

The components in the figures are not necessarily to scale, emphasisinstead being placed upon illustrating the principles of the invention.In the figures, like reference numerals designate corresponding partsthroughout the different views.

FIG. 1 shows the architecture of a computing system.

FIG. 2 is a schematic representation of a prior art data transferarchitecture.

FIG. 3a shows a data processing system arranged for synchronous datatransfer.

FIG. 3b shows a data processing system arranged for asynchronous datatransfer.

FIG. 4 illustrates a typical flow of data transmitted in accordance withthe TCP protocol.

FIG. 5 shows schematically a pair of data processing devicescommunicating over a data link.

DETAILED DESCRIPTION OF THE INVENTION

In the present system the entity that is to process received data, whichcould for example be an application or part of the operating system, iscapable of varying the minimum threshold frequency with which itservices the receive buffer(s) in dependence on conditions that may betaken as indicative of changes in the rate at which data is beingreceived. In this way the entity can service the buffers more quicklywhen data is expected to be received in greater volume, and more slowlywhen little data is expected to be received.

TCP is a common example of a transport protocol, and a discussion of theTCP windows technique will now be given. According to the TCP protocol,each time a certain amount of data is transmitted from one device toanother over a network, the transmitting device must await anacknowledgement from the receiving device before sending further data. ATCP window is the amount of unacknowledged data a sender can send on aparticular connection before it must await an acknowledgement from thereceiver. To give a simple example, the window could be specified as 10octets (or bytes). Thus, once 10 bytes have been sent by a sender, nofurther transmission will be permitted until the sender receives anacknowledgement from the receiver that at least some of those 10 byteshave been received. If the acknowledgement indicates, for example, thatall 10 bytes have been safely received then the sender is able totransmit a further 10 bytes. If the acknowledgement indicates that onlythe first 2 bytes have been received then the sender may transmit 2further bytes and must then await further acknowledgement.

The TCP window mechanism provides congestion control for a network. Thewindow is generally fixed for a connection according to the propertiesof the bottleneck in the connection at a given time to ensure that datacannot be sent at a greater rate than the bottleneck can handle withoutlosing data.

A receiver in a network advertises a receive window to devices withwhich it has network connections, so that the devices can configurethemselves accordingly. The sender's send window will be set equal tothe receiver's receive window for a particular connection. As anexample, the size of the receive window (in bytes) could simply be thesize of the buffer space on a receiving device's network interface minusthe amount of data currently stored in the buffer.

Window announcements from a receiver can include an acknowledgement ofpreviously sent data. This is an efficient arrangement since it canenable two items of information to be sent in one message.

It is commonly desirable for transmission and reception over a networkto be in a steady-state condition whereby packets are periodically sentfrom a sender and acknowledgements (which may be accompanied by windowannouncements) are periodically sent from a receiver. This is anefficient mode of data transfer since data packets can be continually intransit over the network.

The window size for a particular network connection can be varied inresponse to changes within the connection; for example, it could beincreased if more buffer space becomes available at the receiver.Changes in window size are indicated to a sender by means of windowannouncements.

The flow rate of data according to the TCP protocol may be ramped, asillustrated in FIG. 4. In general, the receive window will initially bedefined to be relatively small. If a transmitting application has alarge amount of data to be sent, evidenced by the fact that the receiveris receiving data at the maximum rate permitted by thewindow/acknowledgement technique, in other words if the connection is“busy”, then the receiver may wish to permit a greater rate oftransmission over the connection by increasing its receive window.Typically, the window will be increased in a stepwise manner for as longas the transmitter continues to transmit at the maximum rate and untilpacket drop is detected. When packet drop is detected, the window willbe decreased (typically halved) in order to avoid further loss of data,and this will be advertised in the next window announcement sent by thereceiver. In the exemplary flow shown in FIG. 4, the shaded regions arewhere flow rate is increasing and thus fast window increase is desirableso that the transfer rate can be increased accordingly.

It was noted above that the TCP window may be halved suddenly whenpacket drop is detected. Packet drop may be detected as an absence of anacknowledgement from the receiving device at the transmitting device.Typically, after a predetermined time interval, if no acknowledgement isreceived at the transmitter then the transmitter will commenceretransmission of the packets which have not yet been acknowledged. Theabsence of an acknowledgement may be due to data actually having beenlost over the network connection, or it may be due to a delay in theconnection such that the data has not been received at the receiverwithin the predetermined time interval, causing the transmitter totimeout and commence retransmission.

Considering transmission from a transmitting device, if a send queueassociated with a transmitting application is not permitted adequate CPUtime then protocol processing of data waiting in the queue cannot occursufficiently fast for the send queue to be emptied and the data to besent to the network interface for transmission over the network. If datais awaiting processing and no data is ready to be sent then it ispossible that the connection may go idle. Additionally, send queues maybecome full such that they cannot accept new data. Similarly, for areceiving device it is important that incoming data can be processed atthe rate at which it is being received. Regular and timely processing ofbuffered incoming data is therefore important. In general, timelyprotocol processing of waiting data is desirable to achieve efficientdata transfer over a data link. This is especially true for asynchronoustransmission, since large chunks of data tend to be delivered, whereasin synchronous transmission the chunks tend to be smaller.

Additionally, the overall transfer rate of data can be improved byenabling an increasing window to reach its maximum size as quickly aspossible. This can be achieved by dedicating more CPU time to flowswhich are accelerating than to flows which are constant or increasing.

In the discussion of FIG. 2 above, it was noted that timers may be usedto ensure that queues of data are permitted regular attention. Inrespect of issuing interrupts following timeouts, the inventors of thepresent invention have appreciated that a long timeout can reduce CPUoverhead since it can reduce the number of spurious interrupts. However,a long timeout can be inappropriate in asynchronous transmission modesince it can cause protocol processing to proceed at too slow a rate.

The provision of timeouts for ensuring protocol processing isparticularly important in a system incorporating a user-level protocolstack, such as that described in co-pending PCT application numberPCT/GB05/001525. In such a system protocol processing (such as TCPprocessing) generally takes place within the context of an applicationthread. If the transport library is not given sufficient CPU time (forexample because the thread is performing another operation or isde-scheduled) then by means of a timeout and an interrupt a component ofthe OS can be engaged to carry out the required protocol processing onbehalf of the user-level stack. The question of how to determine anoptimal length for a timeout is thus relevant in this environment. It istherefore proposed by the inventors to provide a means of adjustingtimeouts dynamically according at least to the behaviour of anapplication to or from which data is being sent over the network.

In one instance, it may be desirable to provide a timer at atransmitting device, and to cause the timer to start each time data isdelivered to an element of the device at which protocol processing mustbe performed to enable subsequent transmission of the data over thenetwork. When the timer times out, the operating system or user-levelcode could be triggered to perform protocol processing of the delivereddata. This could ensure that data could flow at an acceptable rate fromthe transmitting device over the network.

In another instance, it may be desirable to provide a timer at areceiving device, and to cause the timer to start each time data isdelivered from the network to an element of the device at which protocolprocessing must be performed for the data to be made available to aconsumer, such as an application running on the receiving device. Inthis instance, when the timer times out, the operating system oruser-level code could be triggered to perform protocol processing of thereceived data. This could ensure that incoming data could be processedat the rate at which it is being received over the network.

The element of the transmitting device could be a transport library orany buffer in which data is stored pending processing to prepare thedata for transmission by a network protocol. The element of thereceiving device could be a transport library or any buffer in whichdata is stored following receipt over a network pending processing toenable the data to be read by a consumer.

When the rate of data transmission over a network is ramping up it isdesirable to ramp up the size of the TCP window as fast as possible inaccordance with the TCP algorithms to keep pace with the amount of datathat needs to be sent so that data does not back up at the transmitter.This can be achieved by reducing the timeout on the timer in thetransmitting device so that data can be sent at the maximum rate (asdefined by the current size of the window), thus triggering windowannouncements as frequently as possible. More generally, it is desirableto respond as quickly as possible to changes in flow rate.

It will be understood that embodiments of the invention can successfullybe applied to both synchronous and asynchronous arrangements.

In general, embodiments of the invention support an algorithm fordetermining an appropriate timeout length for the present conditions inthe connection. The conditions could include the presence or recentoccurrence of congestion, the amount of data which an application wishesto send or receive, the amount of other activity in the network, or anyother aspect of the network which is relevant to the flow of data over aconnection.

Depending on the detected conditions, it could be appropriate to modifythe length of the timeout at the receiver or at the transmitter, or atboth. The timer could suitably be implemented in hardware, for exampleat a network interface, or it could be implemented in software. Thetimer could suitably be arranged to receive instructions to start timingfrom an element of a data processor at which protocol processing isrequired. The element could suitably be a transport library.

In one embodiment, a mechanism can be implemented whereby events issuedby a transport library can be batched together such that only oneinterrupt is raised for multiple events. This can avoid a high overheadon the CPU.

FIG. 5 illustrates a pair of data processing devices communicating overa data link. In the present example device 20 is transmitting data andthe other device 30 is receiving that data. However, the devices couldcommunicate bi-directionally. The devices could be in directcommunication or could communicate indirectly over a network (e.g. viaone or more routers).

In this example the protocol in use by the devices is TCP (transmissioncontrol protocol) over Ethernet, but the present invention is suitablefor use with other protocols.

Device 20 comprises a data store 21 that stores the data that is to betransmitted. Under the control of an application 22 supported by anoperating system 23 and running on a processing unit 24 data from thedata store 21 is passed to a NIC 25 for transmission to device 30. Oncedata has been passed to the NIC for transmission it waits in a sendqueue. When the NIC is able to transmit more data it takes data from thesend queue, encapsulates it in accordance with the protocols that are inuse, and transmits it. The send queue may be embodied by a dedicatedbuffer for outgoing data, or it may be embodied by the storing by theNIC of the addresses of other buffers from which it is to retrieve datafor transmission. The NIC 25 performs protocol processing either aloneor in conjunction with a transport library 26 running on the processingunit 24.

Device 30 comprises a NIC 31 that is connected to a data link 40 bywhich it can receive data from NIC 25. The data is destined for anapplication 32 that is supported by an operating system 33 and runs on aprocessing unit 34. The device further includes a data store 35 forstoring received data, in which one or more buffers can be defined, anda transport library 36 which comprises a set of processes that run onthe processing unit and can have state associated with them forperforming certain networking functions and for assisting in interfacingbetween the NIC and the application. Co-pending applicationsPCT/IB05/002639 and WO2005/086448 describe examples of such a transportlibrary having such functions. The NIC 31 performs protocol processingeither alone or in conjunction with the transport library 36.

The protocol processing functions that are carried out by thetransmitter and the receiver include implementing flow and congestioncontrol. As is well known, TCP provides a number of flow and congestioncontrol functions. One function involves the TCP receive windowdescribed above.

When the application 32 is to receive data it requests from theoperating system 33 that buffer space is allocated to it for storingthat data in the period between it having been received at the NIC 25and it being accepted by the application for processing. The buffer canbe allocated by the operating system 33 or by the transport library 36.Typically the application will maintain a certain amount of receivebuffer space for normal usage. However, a well-designed applicationwill, if it requests a substantial amount of data over the network orlearns that it is about to be sent a substantial amount of data over thenetwork, request additional receive buffer space to accommodate thatdata.

In this context there are numerous situations that are indicative of apotential or ongoing change in the volume data flow over the linkbetween the devices 20 and 30. These include, but are not limited to:

A sudden increase or decrease in the receive buffer allocated to aparticular application, or in total to all the applications running onthe receiving device. This could indicate a change in flow because, asexplained above, a well-designed application will be expected to modifyits allocated buffer space in anticipation of receiving more or lessdata. It should be noted that the amount of allocated buffer spacetherefore provides an indirect means of signalling between theapplication layer and lower layers of the protocol stack. Such anincrease could be defined by an increase or decrease in allocatedreceive buffer space of more than a predetermined amount or more than apredetermined proportion over a predetermined time. Alternatively, suchan increase could be indicated by the relative sizes of the TCP receivewindow for a link and the posted receive buffer size for the applicationto which that link relates. For example, such an increase could bedeemed to have taken place when the posted receive buffer size exceedsthe TCP receive window.

That the send queue of the transmitter for a particular link is notempty, or has remained not empty for greater than a predetermined periodof time. This may be indicative of congestion on the data link betweenthe transmitter and the receiver. However, it may also be indicative ofdata having been received at the receiver and not having been protocolprocessed so that an acknowledgement for that data can be sent to thetransmitter.

That TCP congestion mode is operating: i.e. that the TCP receive windowis reducing (backing off), or subsequently increasing.

Analogous situations will arise under protocols other than TCP and canbe sensed in an analogous way. For example, any distributed transportprotocol such as SCTP has algorithms similar to the TCP receive windowalgorithm. The sliding window mechanism is commonly used. Otherprotocols (often used in conjunction with hardware) such as Infinibandand some ATM link-layer schemes expose credits to the system. Thethresholds or rate of change of such credits can be used as indicatorsof a change in flow rate.

In the present system the NIC 31 implements one or more timers such astimer 37. These timers consist of a counter that is decrementedperiodically when clocked by a clock 38. The clock could run on the NIC,or the NIC itself could be clocked from the remainder of the dataprocessing device to which it is attached. An initial value is loadedinto the timer. When the timer reaches zero the NIC performs an actionstored in its state memory 41 in relation to that timer.

One use of such timers is for allowing the NIC 31 to signal theoperating system 36 to process data that is waiting in a receive buffer.This will now be described in more detail.

When the device 30 is being configured for reception of data by anapplication running on it, the application will request one or moreareas in memory 35 to be allocated to it for use as receive buffers 39.It then signals the NIC to inform it that the link is to be establishedand to inform it which receiver buffer(s) are to be used with that link.The buffer allocation and the signalling may be done via the operatingsystem 33 and/or the transport library 36.

When data is received over the network by NIC 31, the NIC identifieswhich data link that data relates to. This may be indicated by the portnumber on which the data was received and/or by the source address ofthe data. The NIC then stores the data in a receive buffer correspondingto that data link.

Since the traffic data components of the received data cannot beidentified until protocol processing has been done, the received trafficdata cannot be passed to the receiving application until protocolprocessing has been completed. It is preferred that at least some of theprotocol processing that is to be performed on the received data isperformed downstream of the NIC, for example by the operating system orthe transport library of the receiving device 30. This has been found topermit many potential performance enhancements. The protocol processingthat is performed downstream of the NIC can conveniently include thegeneration of or the triggering of acknowledgement messages for receiveddata. A mechanism whereby this can be done will be described below.

When data has been received by the NIC and written to a receive bufferthe operating system or the transport library is to perform protocolprocessing on that data. Either of these may be instructed by thereceiving application to perform the protocol processing. Alternativelyeither of them may be triggered by a timer that runs separately from theapplication. Having such a timer running separately from the applicationis advantageous because it allows the timer to continue to run even ifthe application becomes de-scheduled. It is preferred that the timer isa timer such as timer 37 running on the NIC. Such a timer can beconfigured (e.g. by the application or the transport library) when thedata link is set up. Conveniently, the transport library signals the NICduring establishment of the link to have the NIC allocate one of itstimers for use with the link. To do this the transport library informsthe NIC of the initial value of the timer and the function that is to beperformed when the timer reaches zero. The initial value is stored bythe NIC in memory 41, and the NIC is arranged to automatically reset thetimer to the value stored in that field when it has reached zero. Thefunction may be to instruct the operating system to perform protocolprocessing for any data received for that link and for which protocolprocessing has not yet been performed. The function is also stored inassociation with the respective timer in memory 41 so that it can berecalled and actioned by the NIC when the timer reaches zero. Once thetimer has been set up it will continually decrement, and then trigger(for example) the operating system to perform protocol processing eachtime it reaches zero.

In the present system at least one of the entities implements analgorithm that manages the interval over which the timer 37 operates.The interval is altered in dependence on sensed conditions, particularlycommunication conditions in the transmitter and/or the receiver for thelink with which the timer is associated. Most preferably, the intervalover which the timer operates is reduced when the entity in questionsenses conditions that are indicative of an increase in data rate. Mostpreferably, the interval over which the timer operates is increased whenthe entity in question senses conditions that are indicative of adecrease in data rate. Some specific examples are as follows.

The interval can be reduced in response to an increase or decrease inthe receive buffer allocated to a particular application, or in total toall the applications running on the receiving device. This may bedetected in the conditions described above, and most preferably when theposted receive buffer size for the link in question exceeds the size ofthe TCP receive window for that link. In the opposite conditions theinterval can be increased.

The interval can be reduced if the transmitter's send queue for the linkin question is not empty, or has remained non-empty for a certain periodof time. In the opposite condition the interval can be increased.

The interval can be increased when congestion mode is operating and/orthe transmission scheme is backing off. In the opposite conditions theinterval can be increased.

Altering the timer interval can most conveniently be done by alteringthe initial value for the timer that is stored in memory 41. The newvalue will be applied to the timer when it is next reset. However, itcould also be advantageous to alter the current value of the timer'scounter. The latter method is useful where the interval over which thetimer operates is being reduced from a high value to a significantlylower value.

The entity that applies the algorithm to cause the timer interval to bealtered could be the NIC, the operating system, the transport library orthe application, or another entity. It is preferred that it is appliedby the transport library since it can communicate directly with the NIC.

The timers on the NIC run outside the scope of scheduling by the mainCPU 34 on the device 30, and outside the scope of scheduling by theoperating system of the device 30. As a result they can be expected torun continually independently of the load or processing demands on thedevice 30.

The event that is triggered by the expiry of the timer could be acompound event in which the operating system is triggered to perform anumber of functions. For example, the entity that executes the algorithmcould be arranged to detect that multiple links are using timers thatare set to the same value, and in response to that to group those linksso as to use a single timer that triggers protocol processing for allthose links using a single interrupt. This saves on interrupts.

The entity that is to perform protocol processing on being triggered bythe timer is preferably arranged so that when it is triggered it checkswhether protocol processing is already being performed on received datafor the respective link, and if it is to not perform protocol processingin response to the trigger. It may conveniently detect whether protocolprocessing is being performed by the presence of a flag that any deviceperforming protocol processing on the system is arranged to sent andunset.

The applicant hereby discloses in isolation each individual featuredescribed herein and any combination of two or more such features, tothe extent that such features or combinations are capable of beingcarried out based on the present specification as a whole in the lightof the common general knowledge of a person skilled in the art,irrespective of whether such features or combinations of features solveany problems disclosed herein, and without limitation to the scope ofthe claims. The applicant indicates that aspects of the presentinvention may consist of any such individual feature or combination offeatures disclosed in this application or any application to whichpriority is claimed. In view of the foregoing description it will beevident to a person skilled in the art that various modifications may bemade within the scope of the invention.

What is claimed is:
 1. A method for controlling the processing of datain a data processor, the data processor being connectable to a furtherdevice over a data link, the method comprising the steps of: receivingdata at the data processor; starting a timer, said timer runningseparately to an application of the data processor; and in response tothe timer indicating that a time interval has passed since the start ofthe timer, determining whether processing of the received data by theapplication of said data processor in accordance with a data transferprotocol has begun, and, in response to determining that the applicationhas not begun the processing of the received data in accordance with thedata transfer protocol, triggering a protocol processing element of anoperating system of said data processor to perform the processing of thereceived data in accordance with the data transfer protocol; wherein abuffer space having a size is allocated to the data link for storingdata received at the data processor over the data link, the data linkhaving a protocol that employs a receive window in accordance with whicha transmitter of data according to the protocol will transmit no furthertraffic data once an amount of data defined by the receive window hasbeen transmitted and is unacknowledged by the data processor, such thata size of the receive window is modified to a new size during operation.2. A method according to claim 1 wherein the data processor isconnectable to the data link by means of an interface.
 3. A methodaccording to claim 2 wherein the timer resides on the interface.
 4. Amethod according to claim 2 further comprising the step of in responseto receiving the data, sending an instruction to the timer.
 5. A methodaccording to claim 4 wherein the step of sending an instruction to thetimer comprises triggering the timer directly via a memory mapping.
 6. Amethod according to claim 1 wherein the received data comprises datareceived at the data processor by means of an asynchronous transmissionover the data link.
 7. A method according to claim 1 wherein thereceived data comprises data to be transmitted by means of anasynchronous transmission over the data link.
 8. A method according toclaim 1 wherein the step of triggering the protocol processing elementto perform the processing comprises issuing an interrupt.
 9. A methodaccording to claim 8 wherein the step of triggering the protocolprocessing element to perform the processing comprises issuing aninterrupt to the operating system of the data processor.
 10. The methodaccording to claim 1 further comprising: in response to a modificationof the size of the receive window, setting with the data processor thetime interval in dependence on the size of the buffer space allocated tothe data link and a new size of the receive window.
 11. A methodaccording to claim 10 wherein the step of setting the time intervalcomprises: in response to determining that the new size of the receivewindow indicates an increase in data rate of the data link, reducing thetime interval.
 12. A method according to claim 10 wherein the step ofsetting the interval comprises: in response to determining that the sizeof the buffer space allocated to the data link is greater than the newsize of the receive window, reducing the time interval.
 13. A methodaccording to claim 10 further comprising a step of varying the size ofthe buffer space allocated to the data link in response to a requestfrom a consumer of the traffic data.
 14. A method according to claim 13wherein the consumer is the application.
 15. A method according to claim10 further comprising sensing presence in a transmit buffer of data tobe transmitted over the data link.
 16. A method according to claim 15wherein the step of setting the time interval comprises reducing thetime interval in response to sensing in a transmit buffer data to betransmitted over the data link.
 17. A method according to claim 10wherein the step of setting the time interval comprises reducing thetime interval in response to sensing that a congestion mode of theprotocol is in operation over the data link.
 18. The method of claim 1,wherein the timer is a periodic timer.
 19. A memory storingnon-transitory machine readable code, the machine readable codeexecutable by a data processor and configured to control processing ofdata in the data processor, the data processor being connectable to afurther device over a data link, the machine readable code executing onthe processor to perform the steps: receiving data at the dataprocessor; starting a timer, said timer running separately to anapplication of the data processor; and in response to the timerindicating that a time interval has passed since the start of the timer,determining whether processing of the received data by the applicationof said data processor in accordance with a data transfer protocol hasbegun, and, in response to determining that the application has notbegun the processing of the received data in accordance with the datatransfer protocol, triggering a protocol processing element of anoperating system of said data processor to perform the processing of thereceived data in accordance with the data transfer protocol; wherein abuffer space having a size is allocated to the data link for storingdata received at the data processor over the data link, the data linkhaving a protocol that employs a receive window, the receive windowhaving a size that is modified to a new size during operation, inaccordance with which a transmitter of data according to the protocolwill transmit no further traffic data once an amount of data defined bythe receive window has been transmitted and is unacknowledged by thedata processor.